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b) at least one nucleotide selected from the group consisting of a nucleotide G at positions 



70728, 87860, 88297, 94432, and 95340 of SEQ ID No 1; a nucleotide A at positions 82218, 
83644, 83808, 87787, 87806, 94218, and 97144 of SEQ ID No 1; a nucleotide C at positions 
87902, 88215, 88283, 92760, 93726, and 94422 of SEQ ID No 1; and a nucleotide T at positions 
93903, and 94170 of SEQ ID No 1 ; 

c) a nucleotide G at positions 86435, 93592, 93680, 93681, 93682, 93728, 93761, and 
95445 of SEQ ID No 1; a nucleotide A at positions 86434, 88355, 93240, 93471, and 93747of 
SEQ ID No 1 ; a nucleotide C at positions 93683, 95 1 26, and 95444 of SEQ ID No 1 ; and a 
nucleotide T at positions 94154, and 94430 of SEQ ID No 1; and 

d) a range of nucleotide positions selected from the group consisting of the following 
ranges of nucleotide positions of SEQ ID No 1: 92975-92977, 9371 1-93715, 94151-94153, 
94240-94243, 94770-94773, 94804-94808, 95121-95122, 95129-95135, 95148-95153, 95154- 
95159, 95173-95178, 95367-95374, 95410-95413, 95418-95420, 95430-95436, 95533-95535, 
and 95677-95677. 



^90^An isolated, purified or recombinant polynucleotide comprising a contiguous span of at 
least 12 nucleotides of SEQ ID No 2 or the complementary sequence thereof, wherein said 
contiguous span comprises one of the following: 

a) at least 1 of the nucleotide positions 1-162 of SEQ ID No 2; 

b) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
253, 363, 527, 2471, and 5397 of SEQ ID No 2; a nucleotide C at positions 1013, 1979, and 
2675 of SEQ ID No 2; a nucleotide G at positions 176, 749, 2685, 3593 of SEQ ID No 2; and a 
nucleotide T at positions 2156, and 2423 of SEQ ID No 2; 

c) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
708, 807, 1493, 1724, and 2000; a nucleotide C at positions 1936, 3379, and 3697; a nucleotide 
Gat positions 709, 1845, 1933, 1934, 1935, 1981, 2014, and 3698; and a nucleotide T at 
positions 2407, and 2683 of SEQ ID No 2; and 

d) a range of nucleotide positions selected from the group consisting of the following 
ranges of nucleotide positions of SEQ ID No 2: 1229-1231, 1964-1968, 2404-2406, 2493-2496, 
3023-3026, 3057-3061, 3374-3375, 3382-3388, 3401-3406, 3407-3412, 3426-3431, 3620-3627, 
3663-3666, 3671-3673, 3683-3689, 3786-3788 and 3930-3932. 
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isolated, purified or recombinant polynucleotide comprising a contiguous span of at 
least 12 nucleotides of SEQ ID No 3 or the complementary sequence thereof, wherein said 
contiguous span comprises one of the following: 

a) at least 1 of the following nucleotide positions of SEQ ID No 3: 1=162 and 747-872; 

b) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
253, 363, 527, 2597, and 5523 of SEQ ID No 3; a nucleotide C at positions 1 139, 2105, and 
2801 of SEQ ID No 3; a nucleotide G at positions 176, 875, 281 1, 3719 of SEQ ID No 3; and a 
nucleotide T at positions 2282, and 2549 of SEQ ID No 3; 

c) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
708, 807, 1619, 1850, and 2126; a nucleotide C at positions 2062, 3505, and 3823; a nucleotide 
G at positions 709, 1971, 2059, 2060, 2061, 2107, 2140, and 3824; and a nucleotide T at 
positions 2533, and 2809 of SEQ ID No 3; and 

d) a range of nucleotide positions selected from the group consisting of the following 
ranges of nucleotide positions of SEQ ID No 3: 1355-1357, 1892-1894, 2090-2094, 2530-2532, 
2619-2622, 3149-3152, 3183-3187, 3500-3501, 3508-3514, 3527-3532, 3533-3538, 3552-3557, 
3746-3749, 3789-3792, 3797-3799, 3809-3815, 3912-3914 and 4056-4058. 

f^^kxi isolated, purified or recombinant polynucleotide comprising a contiguous span of at 
least 12 nucleotides of SEQ ID No 4 or the complementary sequence thereof, wherein said 
contiguous span comprises one of the following: 

a) at least 1 of the nucleotide positions 1-162 of SEQ ID No 4; 

b) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
253, 363, 527 and 2460 of SEQ ID No 4; a nucleotide C at position 1013 of SEQ ID No 4 
and a nucleotide G at positions 1 76, and 749 of SEQ ID No 4; 

c) at least one nucleotide selected in the group consisting of a nucleotide A at positions 
708 and 807 and a nucleotide G at position 709 of SEQ ID No 4; and 

d) the pair of nucleotides at positions 1 136-1 137 of SEQ ID No 4. 

^93) An isolated, purified, or recombinant polynucleotide consisting essentially of a 
contiguous span of 8 to 50 nucleotides of anyone of SEQ ID Nos 1, 2, 3, and 4 or the 
complement thereof, wherein said span includes a PCTA-1 -related biallelic marker in said 
sequence. 
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^94^A polynucleotide according to claim 93, wherein said PCTA-1 -related biallelic marker is 
selected from the group consisting of Al to A125, and the complements thereof. 

^95) A polynucleotide according to claim 93, wherein said PCTA-1 -related biallelic marker is 
selected from the group consisting of Al to A44, A46 to A53, A57, A58 5 A62 to A76, A81. A82, 
A86 to A91, A107, Al 18, and A123 to A125, and the complements thereof. 

r 96A A polynucleotide according to claim 93, wherein said PCTA-1 -related biallelic marker is 
selected from the group consisting of A45, A54, A60, A61, A77 to A80, A83 to A85, A93, A102 
to A106, A109, Al 10, Al 14, and A122, and the complements thereof 

f 97 .j A polynucleotide according to claim 93, wherein said PCTA-1 -related biallelic marker is 
selected from the group consisting of A55, A56, A59, A92, A94 to A101, A108, Al 1 1 to Al 13, 
Al 15 to Al 17, and Al 19 to A121, and the complements thereof. 

|^?)A polynucleotide according to claim 93, wherein said contiguous span is 18 to 47 
nucleotides in length and said biallelic marker is within 4 nucleotides of the center of said 
polynucleotide. 

H 99. ) A polynucleotide according to claim 98, wherein said polynucleotide consists of said 
contiguous span and said contiguous span is 25 nucleotides in length and said biallelic marker is 
at the center of said polynucleotide. 

rf 100]\ A polynucleotide according to claim 98, wherein said polynucleotide consists 
essentially of a sequence selected from the following sequences: PI to PI 25, and the 
complementary sequences thereto. 

A 101. JA polynucleotide according to any one of claims 89 to 93, wherein the 3' end of said 
contiguous span is present at the 3 1 end of said polynucleotide. 

f 102. ^A polynucleotide according to claim 93, wherein the 3 f end of said contiguous span is 
located at the 3 f end of said polynucleotide and said biallelic marker is present at the 3 1 end of 
said polynucleotide. 

103. "An isolated, purified, or recombinant polynucleotide consisting essentially of a 
contiguous span of 8 to 50 nucleotides of any one of SEQ ID NOS: 1,2,3,4 or the complement 
thereof, wherein the 3 'end of said contiguous span is located at the 3' end of said polynucleotide, 
and wherein the 3' end of said polynucleotide islocated within 20 nucleotides upstream of a 
PCTA-1 related biallelic marker in said sequence. 



Appl.No. ^/326,402 
Filed _^ : June 4, 1999 

(^V^Sb. polynucleotide according to claim 103, wherein the 3' end of said polynucleotide is 
located 1 nucleotide upstream of said PCTA-1 -related biallelic marker in said sequence. 

polynucleotide according to claim 104, wherein said polynucleotide consists 
essentially of a sequence selected from the following sequences: Dl to D125, and El to E125. 



106. An isolated, purified, or recombinant polynucleotide consisting essentially of a sequence 
selected from the following sequences: Bl to B47 and CI to C47. 

^ 107. ^n isolated, purified, or recombinant polynucleotide which encodes a polypeptide 
comprising a contiguous span of at least 6 amino acids of SEQ ID No 5, wherein said contiguous 
span comprises at least one of the following: 

a) a serine residue at amino acid position 170 of SEQ ID No 5; 

b) a lysine residue at amino acid position 203 of SEQ ID No 5; 

c) a tyrosine residue at amino acid position 18 of SEQ ID No 5; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 5; 

e) a methionine residue at amino acid position 55 of SEQ ID No 5; and 

f) an arginine residue at amino acid position 183 in SEQ ID No 5. 

/lt)8. N An isolated, purified, or recombinant polynucleotide which encodes a polypeptide 

\ . J 

comprising a contiguous span of at least 6 amino acids of SEQ ID No 6, wherein said contiguous 
span comprises at least one of the following: 

a) a serine residue at amino acid position 170 of SEQ ID No 6; 

b) a lysine residue at amino acid position 245 in SEQ ID No 6; 

c) a tyrosine residue at amino acid position 1 8 of SEQ ID No 6; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 6; 

e) a methionine residue at amino acid position 55 of SEQ ID No 6; 

f) an arginine residue at amino acid position 225 in SEQ ID No 6; and 

g) at least 1 of amino acid positions 183-224 of the SEQ ID No 6. 

^^^n isolated, purified, or recombinant polynucleotide which encodes a polypeptide 
comprising a contiguous span of at least 6 amino acids of SEQ ID No 7, wherein said contiguous 
span comprises at least one of the following: 

a) a serine residue at amino acid position 170 of SEQ ID No 7; 

b) a lysine residue at amino acid position 203 of SEQ ID No 7; 

c) tyrosine residue at amino acid position 1 8 of SEQ ID No 7; 

-5- 
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d) a cysteine residue at amino acid position 35 of SEQ ID No 7; 

e) a methionine residue at amino acid position 55 of SEQ ID No 7; 

f) an arginine residue at amino acid position 183 in SEQ ID No 7; and 

g) at least 1 of amino acid positions 313-368 of the SEQ ID No 7. 

1 10. An isolated, purified, or recombinant polynucleotide comprising a contiguous span of at 
least 12 nucleotides of SEQ ID No 8 or the complements thereof, wherein said contiguous span 
comprises at least 1 of the following nucleotide positions of SEQ ID No 8: 1-500, 501-1000, 
1001-1500, and 1501-1738. 

1 1 1. An isolated, purified, or recombinant polynucleotide which encodes a polypeptide 
comprising a contiguous span of at least 6 amino acids of SEQ ID No 9, wherein said contiguous 
span comprises at least 1 of the following amino acid positions of SEQ ID No 9: 1-50, 51-100, 
101-150, 151-200, 201-250, and 251-316. 

1 12. A polynucleotide according to any one of claims 89 to 93, 100, 105 and 106 attached to 
a solid support. 

1 13. An array of polynucleotides comprising at least one polynucleotide according to claim 
112. 

1 14. An array according to claim 113, wherein said array is addressable. 

1 15. A polynucleotide according to any one of claims 89 to 93, 100, 105 and 106 further 
comprising a label. 

1 16. A recombinant vector comprising a polynucleotide according to any one of claims 89 to 
92, and 107 to 111. 

1 1 7. A host cell comprising a recombinant vector according to claim 1 1 6. 

118. A non-human host animal or mammal comprising a recombinant vector according to 
claim 116. 

1 19. A mammalian host cell comprising a PCTA-1 gene disrupted by homologous 
recombination with a knock out vector, comprising a polynucleotide according to any one of 
claims 89 to 92, and 107 to 1 1 1 . 

120. A non-human host mammal comprising a PCTA-1 gene disrupted by homologous 
recombination with a knock out vector, comprising a polynucleotide according to any one of 
claims 89 to 92, and 1 07 to 1 1 1 . 
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^ 1 2 1 . A method of genotymng comprising determining the identity of a nucleotide at a 

PCTA-1 -related biallelic markq- or the complement thereof in a biological sample. 

122. A method according to claim 121, wherein said biological sample is derived from a 
single subject. 

123. A method according to claim 122, wherein the identity of the nucleotides at said 
biallelic marker is determined for both copies of said biallelic marker present in said individual's 
genome. 

124. A method according to claim 121, wherein said biological sample is derived from 
multiple subjects. 

125. A method according to claim 121, further comprising amplifying a portion of said 
sequence comprising the biallelic marker prior to said determining step. 

126. A method according to claim 125, wherein said amplifying is performed by PCR. 

127. A method according to claim 121, wherein said determining is performed by a 
hybridization assay. 

128. A method according to claim 121, wherein said determining is performed by a 
sequencing assay. 

129. A method according to claim 121, wherein said determining is performed by a 
microsequencing assay. 

130. A method according to claim 121, wherein said determining is performed by an 
enzyme-based mismatch detection assay. 

131. A methou of estimating the frequency of an allele of a PCTA- 1 -related biallelic marker 
in a population comprising: 

a) genotyping individuals from said population for said biallelic marker according to the 
method of claim 12lV and 

b) determining the proportional representation of said biallelic marker in said population.. 

132. A method of detecting an association between a genotype and a trait, comprising the 
steps of: 

a) determining the frequency of at least one PCTA-1 -related biallelic marker in trait positive 
population according to the method of claim 131 ; 

b) determining the frequency of at least one PCTA-1 -related biallelic marker in a control 
population according to the method of claim 131; and 
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c) determining whether a statistically significant association exists between said genotype 

and said trait. 

133. A method of estimating the frequency of a haplotype for a set of biallelic markers in a 
population, comprising: 

a) genotyping at least one PCTA-1 -related biallelic marker according to claim 122 for each 
individual in said population; 

b) genotyping a second biallelic marker by determining the identity of the nucleotides at said 
second biallelic marker for both copies of said second biallelic marker present in the genome of 
each individual in said population; and 

c) applying a haplotype determination method to the identities of the nucleotides determined 
in steps a) and b) to obtain an estimate of said frequency. 

134. A method according to claim 133, wherein said haplotype determination method is 
selected from the group consisting of asymmetric PCR amplification, double PCR amplification 
of specific alleles, the Clark algorithm, or an expectation-maximization algorithm. 

135. A method of detecting an association between a haplotype and a trait, comprising the 
steps of: 

a) estimating the frequency of at least one haplotype in a trait positive population according 
to the method of claim 133; 

b) estimating the frequency of said haplotype in a control population according to the method 
of claim 133; and 

c) determining whether a statistically significant association exists between said haplotype 
and said trait. 

136. A method according to claim 132, wherein said genotyping steps a) and b) are 
performed on a single pooled biological sample derived from each of said populations. 

137. A method according to claim 132, wherein said genotyping steps a) and b) performed 
separately on biological samples derived from each individual in said populations. 

a £p^^ 138. A method according to eitAer claim 132 or 135, wherein said trait is cancer, prostate 
^ cancer, an early onset of prostate cancer, a beneficial response to or side effects related to 

treatment or a vaccination against prostate cancer, a susceptibility to prostate cancer, the level of 
aggressiveness of prostate cancer tumors, a modified or forthcoming expression of the PCTA-1 
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gene, a modified on forthcoming production of the PCTA-1 protein, or the production of a 
modified PCTA-1 protein. 

139. A method according to claim 138 wherein said trait is prostate cancer. 
^>140. A melhod according to either claim 132 or 135, wherein said control population is a 
trait negative population. 

141. A method according to either claim 132 or 135, wherein said case control population is 
a random population. 

142. An isolated, purified, or recombinant polypeptide comprising a contiguous span of at 
least 6 amino acids of SEQ ID No 5, wherein said contiguous span comprises at least one of the 
following: 

a) a serine residue at amino acid position 170 of SEQ ID No 5; 

b) a lysine residue at amino acid position 203 of SEQ ID No 5; 

c) a tyrosine residue at amino acid position 18 of SEQ ID No 5; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 5; 

e) a methionine residue at amino acid position 55 of SEQ ID No 5; and 

f) an arginine residue at amino acid position 183 in SEQ ID No 5. 

143. An isolated, purified, or recombinant polypeptide comprising a contiguous span of at 
least 6 amino acids of SEQ ID No 6, wherein said contiguous span comprises at least one of the 
following: 

a) a serine residue at amino acid position 170 of SEQ ID No 6; 

b) a lysine residue at amino acid position 245 in SEQ ID No 6; 

c) a tyrosine residue at amino acid position 18 of SEQ ID No 6; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 6; 

e) a methionine residue at amino acid position 55 of SEQ ID No 6; 

f) an arginine residue at amino acid position 225 in SEQ ID No 6; and 

g) at least 1 of amino acid positions 183-224 of the SEQ ID No 6. 

144. An isolated, purified, or recombinant polypeptide comprising a contiguous span of at 
least 6 amino acids of SEQ ID No 7, wherein said contiguous span comprises at least one of the 
following: 

a) a serine residue at amino acid position 170 of SEQ ID No 7; 

b) a lysine residue at amino acid position 203 of SEQ ID No 7; 
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c) tyrosine residue at amino acid position 18 of SEQ ID No 7; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 7; 

e) a methionine residue at amino acid position 55 of SEQ ID No 7; 

f) an arginine residue at amino acid position 183 in SEQ ID No 7; and 

g) at least 1 of amino acid positions 313-368 of the SEQ ID No 7. 

145. An isolated, purified, or recombinant polypeptide comprising a contiguous span of at 
least 6 amino acids of SEQ ID No 9, wherein said contiguous span comprises at least 1 of the 
following amino acid positions of SEQ ID No 9: 1-50, 51-100, 101-150, 151-200, 201-250, and 
251-316. 

146. An isolated or purified antibody composition are capable of selectively binding to an 
epitope-containing fragment of a polypeptide according to claim 142, wherein said epitope 
comprises at least one of the following: 

a) a serine residue at amino acid position 170 of SEQ ID No 5; 

b) a lysine residue at amino acid position 203 of SEQ ID No 5; 

c) a tyrosine residue at amino acid position 18 of SEQ ID No 5; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 5; 

e) a methionine residue at amino acid position 55 of SEQ ID No 5; and 

f) an arginine residue at amino acid position 183 in SEQ ID No 5. 

147. An isolated or purified antibody composition are capable of selectively binding to an 
epitope-containing fragment of a polypeptide according to claim 143, wherein said epitope 
comprises at least one of the following: 

a) a serine residue at amino acid position 170 of SEQ ID No 6; 

b) a lysine residue at amino acid position 245 in SEQ ID No 6; 

c) a tyrosine residue at amino acid position 1 8 of SEQ ID No 6; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 6; 

e) a methionine residue at amino acid position 55 of SEQ ID No 6; 

f) an arginine residue at amino acid position 225 in SEQ ID No 6; and 

g) at least 1 of amino acid positions 183-224 of the SEQ ID No 6. 

148. An isolated or purified antibody composition are capable of selectively binding to an 
epitope-containing fragment of a polypeptide according to claim 144, wherein said epitope 
comprises at least one of the following: 



-10- 
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a) a serine residue at amino acid position 170 of SEQ ID No 7; 

b) a lysine residue at amino acid position 203 of SEQ ID No 7; 

c) tyrosine residue at amino acid position 1 8 of SEQ ID No 7; 

d) a cysteine residue at amino acid position 35 of SEQ ID No 7; 

e) a methionine residue at amino acid position 55 of SEQ ID No 7; 

f) an arginine residue at amino acid position 1 83 in SEQ ID No 7; and 

g) at least 1 of amino acid positions 313-368 of the SEQ ID No 7. 

149. An isolated or purified antibody composition are capable of selectively binding to an 
epitope-containing fragment of a polypeptide according to claim 145, wherein said epitope 
comprises at least 1 of the following amino acid positions of SEQ ID No 9: 1-50, 51-100, 101- 
150, 151-200, 201-250, and 251-316. 

150. A methoasof determining whether an individual is at risk of developing prostate cancer, 
comprising: \. 



a) genotyping at least ohe PCTA-1 -related biallelic marker according to the method of claim 
123; and \ 

b) correlating the result of step\a) with a risk of developing prostate cancer. 

151. A method according to anvMe of claims 121, 131, 132, 133, 135, and 150 wherein said 
PCTA-1 -related biallelic marjteris selected from the group consisting of Al to A125 and the 
complements thereof/^ 

152. A metljadaccording to claim 150, wherein said PCTA-1 -related biallelic marker is 
selected pem the following list of biallelic markers: A2, A30, A41, A55 and A57, and the 
cpinf)lements thereof. 

153. A diagnostic kit comprising a polynucleotide according to any one of claims 93, 100, 
105, and 106. 

154. A diagnostic kit comprising a polynucleotide according to claim 112. 

155. A diagnostic kit comprising a polynucleotide according to claim 115. 

156. A computer readable medium having stored thereon a sequence selected from the group 
consisting of a nucleic acid code comprising one of the following: 

a) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous span 
comprises at least 1 of the following nucleotide positions of SEQ ID No 1: 1-70715, 70795- 
82207, 82297-83612, 83824-85297, 85418-86388, 86446-87495, 87523-88294, 88384-89483, 
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89650-92748, 97156-98309, 98476-99329, 99491-100026, 100212-100281, 100396-100538, 
100682-100833, 100995-101920, 102087-102970, 103264-103724, and 103753-106746; 

b) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises at least one nucleotide selected from the group consisting of a nucleotide G at 
positions 70728, 87860, 88297, 94432, and 95340 of SEQ ID No 1; a nucleotide A at positions 
82218, 83644, 83808, 87787, 87806, 94218, and 97144 of SEQ ID No 1; a nucleotide C at 
positions 87902, 8821 5, 88283, 92760, 93726, and 94422 of SEQ ID No 1 ; and a nucleotide T at 
positions 93903, and 94170 of SEQ ID No 1; 

c) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises at least one nucleotide selected from the group consisting of a nucleotide G at 
positions 86435, 93592, 93680, 93681, 93682, 93728, 93761, and 95445 of SEQ ID No 1; a 
nucleotide A at positions 86434, 88355, 93240, 93471, and 93747of SEQ ID No 1; a nucleotide 
C at positions 93683, 95126, and 95444 of SEQ ID No 1; and a nucleotide T at positions 94154, 
and 94430 of SEQ ID No 1; 

d) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises nucleotide positions selected from the group consisting of the nucleotide 
positions of SEQ ID No 1: 92975-92977, 93711-93715, 94151-94153, 94240-94243, 94770- 
94773, 94804-94808, 95121-95122, 95129-95135, 95148-95153, 95154-95159, 95173-95178, 
95367-95374, 95410-95413, 95418-95420, 95430-95436, 95533-95535, and 95677-95677; 

e) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least lof the nucleotide positions 1-162 of SEQ ID No 2; 

f) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 253, 363, 527, 2471, and 5397 of SEQ ID No 2; a nucleotide C at positions 1013, 
1979, and 2675 of SEQ ID No 2; a nucleotide G at positions 176, 749, 2685, 3593 of SEQ ID 
No 2; and a nucleotide T at positions 2156, and 2423 of SEQ ID No 2; 

g) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 708, 807, 1493, 1724, and 2000; a nucleotide C at positions 1936, 3379, and 3697; a 
nucleotide G at positions 709, 1845, 1933, 1934, 1935, 1981, 2014, and 3698; and a nucleotide 
T at positions 2407, and 2683 of SEQ ID No 2; 

-12- 
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h) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises nucleotide positions selected from the group consisting of the nucleotide 
positions of SEQ ID No 2: 1229-1231, 1964-1968, 2404-2406, 2493-2496, 3023-3026, 3057- 
3061, 3374-3375, 3382-3388, 3401-3406, 3407-3412, 3426-3431, 3620-3627, 3663-3666, 3671- 
3673, 3683-3689, 3786-3788 and 3930-3932; 

i) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous span 
comprises at least 1 of the following nucleotide positions of SEQ ID No 3: 1-162 and 747-872; 

j) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous span 
comprises at least one nucleotide selected in the group consisting of a nucleotide A at positions 
253, 363, 527, 2597, and 5523 of SEQ ID No 3; a nucleotide C at positions 1 139, 2105, and 
2801 of SEQ ID No 3; a nucleotide G at positions 176, 875, 281 1, 3719 of SEQ ID No 3; and a 
nucleotide T at positions 2282, and 2549 of SEQ ID No 3; 

k) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous span 
comprises at least one nucleotide selected in the group consisting of a nucleotide A at positions 
708, 807, 1619, 1850, and 2126; a nucleotide C at positions 2062, 3505, and 3823; a nucleotide 
G at positions 709, 1971, 2059, 2060, 2061, 2107, 2140, and 3824; and a nucleotide T at 
positions 2533, and 2809 of SEQ ID No 3; 

1) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous span 
comprises nucleotide positions selected from the group consisting of the nucleotide positions of 
SEQ ID No 3: 1355-1357, 1892-1894, 2090-2094, 2530-2532, 2619-2622, 3149-3152, 3183- 
3187, 3500-3501, 3508-3514, 3527-3532, 3533-3538, 3552-3557, 3746-3749, 3789-3792, 3797- 
3799, 3809-3815, 3912-3914 and 4056-4058; 

m) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least 1 of the nucleotide positions 1-162 of SEQ ID No 4; 

n) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 253, 363, 527 and 2460 of SEQ ID No 4; a nucleotide C at position 101 3 of SEQ ID 
No 4 and a nucleotide G at positions 176, and 749 of SEQ ID No 4; 

o) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 708 and 807 and a nucleotide G at position 709 of SEQ ID No 4; 

-13- 
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p) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises the pairs of nucleotide positions 1 136-1 137 of SEQ ID No 4; 

q) a contiguous span of at least 12 nucleotides of SEQ ID No 8, wherein said contiguous 
span comprises at least 1 of the following nucleotide positions of SEQ ID No 8: 1-500, 501- 
1000, 1001-1500, and 1501-1738; and, _ 

r) aur^ complementary to any one of the nucleotide sequences,pf a) to q). 

157. A computer readable medium having stored thereon a sequence consisting of a 
polypeptide code comprising at least one of the following: 

a) a contiguous span of at least 6 amino acids of SEQ ID No 5, wherein said contiguous 
span comprises at least one of the following: 

i) a serine residue at amino acid position 170 of SEQ ID No 5; 

ii) a lysine residue at amino acid position 203 of SEQ ID No 5; 

iii) a tyrosine residue at amino acid position 1 8 of SEQ ID No 5; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 5; 

v) a methionine residue at amino acid position 55 of SEQ ID No 5; and 

vi) an arginine residue at amino acid position 1 83 in SEQ ID No 5; 

b) a contiguous span of at least 6 amino acids of SEQ ID No 6, wherein said contiguous 
span comprises at least one of the following: 

i) a serine residue at amino acid position 1 70 of SEQ ID No 6; 

ii) a lysine residue at amino acid position 245 in SEQ ID No 6; 

iii) a tyrosine residue at amino acid position 18 of SEQ ID No 6; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 6; 

v) a methionine residue at amino acid position 55 of SEQ ID No 6; 

vi) an arginine residue at amino acid position 225 in SEQ ID No 6; and 

vii) at least 1 of amino acid positions 183-224 of the SEQ ID No 6; 

c) a contiguous span of at least 6 amino acids of SEQ ID No 7, wherein said contiguous 
span comprises at least one of the following: 

i) a serine residue at amino acid position 1 70 of SEQ ID No 7; 

ii) a lysine residue at amino acid position 203 of SEQ ID No 7; 

iii) tyrosine residue at amino acid position 18 of SEQ ID No 7; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 7; 
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v) a methionine residue at amino acid position 55 of SEQ ID No 7; 

vi) an arginine residue at amino acid position 183 in SEQ ID No 7; and 

vii) at least 1 of amino acid positions 313-368 of the SEQ ID No 7; and 
d) a contiguous span of at least 6 amino acids of SEQ ID No 9. 

158. A computer system comprising a processor and a data storage device wherein said data 
storage device a computer readable medium according to claim 156 or 157. 

159. A computer system according to claim 158, further comprising a sequence comparer 
and a data storage device having reference sequences stored thereon. 

160. A computer system of Claim 159 wherein said sequence comparer comprises a 
computer program which indicates polymorphisms. 

161. A computer system of Claim 158 further comprising an identifier which identifies 
features4n said sequence. 

162. A method for comparing a first sequence to a reference sequence, comprising the steps 

of: 

reading said first sequence and said reference sequence through use of a computer program 
which compares sequences; and 

determining differences between said first sequence and said reference sequence with said 
computer program, 

wherein said first sequence is selected from the group consisting of a nucleic acid code 
comprising one of the following: 

a) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises at least 1 of the following nucleotide positions of SEQ ID No 1 : 1-70715, 70795- 
82207, 82297-83612, 83824-85297, 85418-86388, 86446-87495, 87523-88294, 88384-89483, 
89650-92748, 97156-98309, 98476-99329, 99491-100026, 100212-100281, 100396-100538, 
100682-100833, 100995-101920, 102087-102970, 103264-103724, and 103753-106746; 

b) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises at least one nucleotide selected from the group consisting of a nucleotide G at 
positions 70728, 87860, 88297, 94432, and 95340 of SEQ ID No 1 ; a nucleotide A at positions 
82218, 83644, 83808, 87787, 87806, 94218, and 97144 of SEQ ID No 1; a nucleotide C at 
positions 87902, 88215, 88283, 92760, 93726, and 94422 of SEQ ID No 1; and a nucleotide T at 
positions 93903, and 94170 of SEQ ID No 1; 
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c) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises at least one nucleotide selected from the group consisting of a nucleotide G at 
positions 86435, 93592, 93680, 93681, 93682, 93728, 93761, and 95445 of SEQ ID No 1; a 
nucleotide A at positions 86434, 88355, 93240, 93471, and 93747of SEQ ID No 1; a nucleotide 
C at positions 93683, 95126, and 95444 of SEQ ID No 1 ; and a nucleotide T at positions 94154, 
and 94430 of SEQ IDNol; 

d) a contiguous span of at least 12 nucleotides of SEQ ID No 1, wherein said contiguous 
span comprises nucleotide positions selected from the group consisting of the nucleotide 
positions of SEQ ID No 1: 92975-92977, 9371 1-93715, 94151-94153, 94240-94243, 94770- 
94773, 94804-94808, 95121-95122, 95129-95135, 95148-95153, 95154-95159, 95173-95178, 
95367-95374, 95410-95413, 95418-95420, 95430-95436, 95533-95535, and 95677-95677; 

e) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least lof the nucleotide positions 1-162 of SEQ ID No 2; 

f) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 253, 363, 527, 2471, and 5397 of SEQ ID No 2; a nucleotide C at positions 1013, 
1979, and 2675 of SEQ ID No 2; a nucleotide G at positions 176, 749, 2685, 3593 of SEQ ID 
No 2; and a nucleotide T at positions 21 56, and 2423 of SEQ ID No 2; 

g) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 708, 807, 1493, 1724, and 2000; a nucleotide C at positions 1936, 3379, and 3697; a 
nucleotide G at positions 709, 1845, 1933, 1934, 1935, 1981, 2014, and 3698; and a nucleotide 
T at positions 2407, and 2683 of SEQ ID No 2; 

h) a contiguous span of at least 12 nucleotides of SEQ ID No 2, wherein said contiguous 
span comprises nucleotide positions selected from the group consisting of the nucleotide 
positions of SEQ ID No 2: 1229-1231, 1964-1968, 2404-2406, 2493-2496, 3023-3026, 3057- 
3061, 3374-3375, 3382-3388, 3401-3406, 3407-3412, 3426-3431, 3620-3627, 3663-3666, 3671- 
3673, 3683-3689, 3786-3788 and 3930-3932; 

i) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous 
span comprises at least 1 of the following nucleotide positions of SEQ ID No 3: 1-162 and 747- 
872; 
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j) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 253, 363 5 527, 2597, and 5523 of SEQ ID No 3; a nucleotide C at positions 1 139, 
2105, and 2801 of SEQ ID No 3; a nucleotide G at positions 176, 875, 2811, 3719 of SEQ ID 
No 3; and a nucleotide T at positions 2282, and 2549 of SEQ ID No 3; 

k) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 708, 807, 1619, 1850, and 2126; a nucleotide C at positions 2062, 3505, and 3823; a 
nucleotide G at positions 709, 1971, 2059, 2060, 2061, 2107, 2140, and 3824; and a nucleotide 
T at positions 2533, and 2809 of SEQ ID No 3; 

1) a contiguous span of at least 12 nucleotides of SEQ ID No 3, wherein said contiguous 
span comprises nucleotide positions selected from the group consisting of the nucleotide 
positions of SEQ ID No 3: 1355-1357, 1892-1894, 2090-2094, 2530-2532, 2619-2622, 3149- 
3152, 3183-3187, 3500-3501, 3508-3514, 3527-3532, 3533-3538, 3552-3557, 3746-3749, 3789- 
3792, 3797-3799, 3809-3815, 3912-3914 and 4056-4058; 

m) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least 1 of the nucleotide positions 1-162 of SEQ ID No 4; 

n) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 253, 363, 527 and 2460 of SEQ ID No 4; a nucleotide C at position 1013 of SEQ ID 
No 4 and a nucleotide G at positions 176, and 749 of SEQ ID No 4; 

o) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises at least one nucleotide selected in the group consisting of a nucleotide A at 
positions 708 and 807 and a nucleotide G at position 709 of SEQ ID No 4; 

p) a contiguous span of at least 12 nucleotides of SEQ ID No 4, wherein said contiguous 
span comprises the pairs of nucleotide positions 1 136-1 137 of SEQ ID No 4; 

q) a contiguous span of at least 12 nucleotides of SEQ ID No 8, wherein said contiguous 
span comprises at least 1 of the following nucleotide positions of SEQ ID No 8: 1-500, 501- 
1000, 1001-1500, and 1501-1738; 

r) a nucleotide sequence complementary to any one of the nucleotide sequences of a) to q); 
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s) a contiguous span of at least 6 amino acids of SEQ ID No 5, wherein said contiguous 
span comprises at least one of the following: 

i) a serine residue at amino acid position 170 of SEQ ID No 5; 

ii) a lysine residue at amino acid position 203 of SEQ ID No 5; 

iii) a tyrosine residue at amino acid position 18 of SEQ ID No 5; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 5; 

v) a methionine residue at amino acid position 55 of SEQ ID No 5; and 

vi) an arginine residue at amino acid position 183 in SEQ ID No 5; 

$ t) a contiguous span of at least 6 amino acids of SEQ ID No 6, wherein said contiguous 

^ span comprises at least one of the following: 

^ i) a serine residue at amino acid position 170 of SEQ ID No 6; 

ii) a lysine residue at amino acid position 245 in SEQ ID No 6; 

iii) a tyrosine residue at amino acid position 18 of SEQ ID No 6; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 6; 

v) a methionine residue at amino acid position 55 of SEQ ID No 6; 

vi) an arginine residue at amino acid position 225 in SEQ ID No 6; and 

vii) at least 1 of amino acid positions 183-224 of the SEQ ID No 6; 
u) a contiguous span of at least 6 amino acids of SEQ ID No 7, wherein said contiguous 

span comprises at least one of the following: 

i) a serine residue at amino acid position 170 of SEQ ID No 7; 

ii) a lysine residue at amino acid position 203 of SEQ ID No 7; 

iii) tyrosine residue at amino acid position 18 of SEQ ID No 7; 

iv) a cysteine residue at amino acid position 35 of SEQ ID No 7; 

v) a methionine residue at amino acid position 55 of SEQ ID No 7; 

vi) an arginine residue at amino acid position 183 in SEQ ID No 7; and 

vii) at least 1 of amino acid positions 313-368 of the SEQ ID No 7; and 
v) a contiguous span of at least 6 amino acids of SEQ ID No 9. 



Remarks 

The Examiner is respectfully requested to direct questions concerning this application to 
the undersigned attorney at the telephone number appearing below. 
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